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SEQUENCE LISTING 



SEQ ID N0:1 

EBOGPl /MBGGP2 nucleotide sequence 



10 20 30 40 50 

atgggcgtta caggaatatt gcagttacct cgtgatcgat tcaagaggac 

60 70 80 90 100 

atcattcttt ctttgggtaa ttatcctttt ccaaagaaca ttttccatcc 

110 120 130 140 150 

cacttggagt catccacaat agcacattac aggttagtga tgtcgacaaa 

160 170 180 190 200 

ctagtttgtc gtgacaaact gtcatccaca aatcaattga gatcagttgg 

210 220 230 240 250 

actgaatctc gaagggaatg gagtggcaac tgacgtgcca tctgcaacta 

260 270 280 290 300 

aaagatgggg cttcaggtcc ggtgtcccac caaaggtggt caattatgaa 

310 320 330 340 350 

gctggtgaat gggctgaaaa ctgctacaat cttgaaatca aaaaacctga 

360 370 380 390 400 

cgggagtgag tgtctaccag cagcgccaga cgggattcgg ggcttccccc 

410 420 430 440 450 

ggtgccggta tgtgcacaaa gtatcaggaa cgggaccgtg tgccggagac 

460 470 480 490 500 

tttgccttcc ataaagaggg tgctttcttc ctgtatgatc gacttgcttc 

510 520 530 540 550 

cacagttatc taccgaggaa cgactttcgc tgaaggtgtc gttgcatttc 

560 570 580 590 600 

tgatactgcc ccaagctaag aaggacttct tcagctcaca ccccttgaga 

610 620 630 640 650 

gagccggtca atgcaacgga ggacccgtct agtggctact attctaccac 

660 670 680 690 700 

aattagatat caggctaccg gttttggaac caatgagaca gagtacttgt 

710 720 730 740 750 

tcgaggttga caatttgacc tacgtccaac ttgaatcaag attcacacca 



760 770 780 790 800 

cagtttctgc tccagctgaa tgagacaata tatacaagtg ggaaaaggag 
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810 820 830 840 850 

caataccacg ggaaaactaa tttggaaggt caaccccgaa attgatacaa 

860 870 880 890 900 

caatcgggga gtgggccttc tgggaaacta aaaaaaacct cactagaaaa 

910 920 930 940 950 

attcgcagtg aagagttgtc tttcacagtt gtatcaaacg gagccaaaaa 

960 970 980 990 1000 

catcagtggt cagagtccgg cgcgaacttc ttccgaccca gggaccaaca 

1010 1020 1030 1040 1050 

caacaactga agaccacaaa atcatggctt cagaaaattc ctctgcaatg 

1060 1070 1080 1090 1100 

gttcaagtgc acagtcaagg aagggaagct gcagtgtcgc atctaacaac 

1110 1120 1130 1140 1150 

ccttgccaca atctccacga gtccccaatc cctcacaacc aaaccaggtc 

1160 1170 1180 1190 1200 

cggacaacag cacccataat acacccgtgt ataaacttga catctctgag 

1210 1220 1230 1240 1250 

gcaactcaag ttgaacaaca tcaccgcaga acagacaacg acagcacagc 

1260 1270 1280 1290 1300 

ctccgacact ccctctgcca cgaccgcagc cggaccccca aaagcagaga 

1310 1320 1330 1340 1350 

acaccaacac gagcaagagc actgacttcc tggaccccgc caccacaaca 

1360 1370 1380 1390 1400 

agtccccaaa accacagcga gaccgctggc aacaacaaca ctcatcacca 

1410 1420 1430 1440 1450 

agataccgga gaagagagtg ccagcagcgg gaagctaggc ttaattacca 

1460 1470 1480 1490 1500 

atactattgc tggagtcgca ggactgatca caggcgggag aagaactcga 

1510 1520 1530 1540 1550 

cgatcgatcc tctggaggga aggcgacatg ttcccttttc tggatgggtt 

1560 1570 1580 1590 1600 

aataaatgct ccaattgatt ttgacccagt tccaaataca aaaacaatct 

1610 1620 1630 1640 1650 

ttgatgaatc ctctagttct ggtgcctcgg ctgaggaaga tcaacatgcc 

1660 1670 1680 1690 1700 

tcccccaata ttagtttaac tttatcttat tttcctaata taaatgagaa 
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1710 


1720 


1730 


1740 


1750 


cactgcctac 


tctggagaaa 


atgagaatga 


ttgtgatgca 


gagttaagaa 


1760 


1770 


1780 


1790 


1800 


tttggagcgt 


tcaggaggat 


gacctggccg 


cagggctcag 


ttggataccg 


1810 


1820 


1830 


1840 


1850 


ttttttggcc 


ctggaattga 


aggactttac 


actgctgttt 


taattaaaaa 


1860 


1870 


1880 


1890 


1900 


tcaaaacaat 


ttggtctgca 


ggttgaggcg 


tctagccaat 


caaactgcca 


1910 


1920 


1930 


1940 


1950 


aatccttgga 


actcttattg 


agagtcacaa 


ctgaggaaag 


aacattctcc 


1960 


1970 


1980 


1990 


2000 


ttaatcaata 


gacatgctat 


tgactttcta 


ctcacaagat 


ggggaggaac 


2010 


2020 


2030 


2040 


2050 


atgcaaagtg 


cttggacctg 


attgttgcat 


cgggatagaa 


gacttgtcca 


2060 


2070 


2080 


2090 


2100 


aaaatatttc 


agagcaaatt 


gaccaaatta 


aaaaggacga 


acaaaaagag 


2110 


2120 


2130 


2140 


2150 


gggactggtt 


ggggtctggg 


tggtaaatgg 


tggacatccg 


actggggtgt 


2160 


2170 


2180 


2190 


2200 


tcttactaac 


ttgggcattt 


tgctactatt 


atccatagct 


gtcttgattg 


2210 


2220 


2230 


2240 


2250 


ctctatcctg 


tatttgtcgt 


atctttacta 


aatatatcgg 


ataacggaat 



tc 



SEQ ID NO: 2 

EBOGP1 /MBGGP2 amino acid sequence 

MetGlyValThrGlylleLeuGlnLeuProArgAspArgPheLys 

ArgThr Ser PhePheLeuTrpVal I 1 el leLeuPheGlnArgThr Phe 

SerlleProLeuGlyVallleHisAsnSerThrLeuGlnValSerAsp 

ValAspLysLeuValCysArgAspLysLeuSerSerThrAsnGlnLeuArg 

SerValGlyLeuAsnLeuGluGlyAsnGlyValAlaThrAspValProSer 

AlaThrLysArgTrpGlyPheArgSerGlyValProProLysValVal 

AsnTyrGluAlaGlyGluTrpAlaGluAsnCysTyrAsnLeuGluIleLys 

LysProAspGlySerGluCysLeuProAlaAlaProAspGlylleArgGly 
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PheProArgCysArgTyrValHisLysValSerGlyThrGlyProCys 

AlaGlyAspPheAlaPheHisLysGluGlyAlaPhePheLeuTyrAspArg 

LexiAlaSerThrVallleTyrArgGlyThrThrPheAlaGluGlyValVal 

AlaPheLeuIleLeuProGlnAlaLysLysAspPhePheSerSerHis 

ProLenArgGluProValAsnAlaThrGluAspProSerSerGlyTyrTyr 

SerThrThrlleArgTyrGlnAlaThrGlyPheGlyThrAsnGluThrGlu 

TyrLeuPheGluValAspAsnLeuThrTyrValGlnLeuGluSerArg 

PheThrProGlnPheLeuLeuGlnLeiiAsnGluThrlleTyrThrSerGly 

LysArgSerAsnThrThrGlyLysLeuIleTrpLysValAsnProGluIle 

AspThrThrlleGlyGluTrpAlaPheTrpGluThrLysLysAsnLeu 

ThrArgLysIleArgSerGluGluLeuSerPheThrValValSerAsnGly 

AlaLysAsnlleSerGlyGlnSerProAlaArgThrSerSerAspProGly 

ThrAsnThrThrThrGluAspHisLysIleMetAlaSerGluAsnSer 

SerAlaMetValGlnValHisSerGlnGlyArgGluAlaAlaValSerHis 

LeuThrThrLeuAlaThrlleSerThrSerProGlnSerLeuThrThrLys 

ProGlyProAspAsnSerThrHisAsnThrProValTyrLysLeuAsp 

IleSerGluAlaThrGlnValGluGlnHisHisArgArgThrAspAsnAsp 

SerThrAlaSerAspThrProSerAlaThrThrAlaAlaGlyProProLys 

AlaGluAsnThrAsnThrSerLysSerThrAspPheLeiiAspProAla 

ThrThrThrSerProGlnAsnHisSerGluThrAlaGlyAsnAsnAsnThr 

HisHisGlnAspThrGlyGluGluSerAlaSerSerGlyLysLeuGlyLeu 

I 1 eThr AsnThr I 1 e A 1 aGly Va 1 Al aG lyLeu I 1 eThr GlyG lyAr g 

ArgThrArgArgSerlleLeuTrpArgGluGlyAspMetPheProPheLeu 

AspGlyLeuIleAsnAlaProIleAspPheAspProValProAsnThrLys 

ThrllePheAspGluSerSerSerSerGlyAlaSerAlaGluGluAsp 

GlnHisAlaSerProAsnlleSerLeuThrLeuSerTyrPheProAsnlle 

AsnGluAsnThrAlaTyrSerGlyGluAsnGluAsnAspCysAspAlaGlu 
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LeuArglleTrpSerValGlnGluAspAspLeioAlaAlaGlyLeuSer 

TrpIleProPhePheGlyProGlylleGluGlyLeuTyrThrAlaValLeu 

IleLysAsnGlnAsnAsnLeuValCysArgLeuArgArgLeuAlaAsnGln 

ThrAlaLysSerLeuGluLeuLeuLeuArgValThrThrGluGluArg 

ThrPheSerLeuIleAsnArgHisAlalleAspPheLeuLeuThrArgTrp 

GlyGlyThrCysLysValLeuGlyProAspCysCysIleGlylleGluAsp 

LeuSerLysAsnlleSerGluGlnlleAspGlnlleLysLysAspGlu 

GlnLysGluGlyThrGlyTrpGlyLeuGlyGlyLysTrpTrpThrSerAsp 

TrpGlyValLeuThrAsnLeuGlylleLeuLeuLeuLeuSerlleAlaVal 

LeuIleAlaLeuSerCysIleCysArgllePheThrLysTyrlleGly* 



SEQ ID NO: 3 

MBGGPl /EB0GP2 nucleotide sequence 



10 


20 


30 


40 


50 


atgaagacca 


catgtttcct 


tatcagtctt 


atcttaattc 


aagggacaaa 


60 


70 


80 


90 


100 


aaatctcccc 


attttagaga 


tagctagtaa 


taatcaaccc 


caaaatgtgg 


110 


120 


130 


140 


150 


attcggtatg 


ctccggaact 


ctccagaaga 


cagaagacgt 


ccatctgatg 


160 


170 


180 


190 


200 


ggattcacac 


tgagtgggca 


aaaagttgct 


gattcccctt 


tggaggcatc 


210 


220 


230 


240 


250 


caagcgatgg 


gctttcagga 


caggtgtacc 


tcccaagaat 


gttgagtaca 


260 


270 


280 


290 


300 


cagaggggga 


ggaagccaaa 


acatgctaca 


atataagtgt 


aacggatccc 


310 


320 


330 


340 


350 


tctggaaaat 


ccttgctgtt 


agatcctcct 


accaacatcc 


gtgactatcc 


360 


370 


380 


390 


400 


gaaatgcaaa 


actatccatc 


atattcaagg 


tcaaaaccct 


catgcacagg 


410 


420 


430 


440 


450 


ggatcgccct 


tcatttatgg 


ggagcatttt 


ttctgtatga 


tcgcattgcc 


460 


470 


480 


490 


500 


tccacaacaa 


tgtaccgagg 


caaagtcttc 


actgaaggga 


acatagcagc 
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510 520 530 540 550 

tatgattgtc aataagacag tgcacaaaat gattttctcg cggcaaggac 

560 570 580 590 600 

aagggtaccg tcatatgaat ctgacttcta ctaataaata ttggacaagt 

610 620 630 640 650 

agtaacggaa cgcaaacgaa tgacactgga tgtttcggcg ctcttcaaga 

660 670 680 690 700 

atacaattct acaaagaacc aaacatgtgc tccgtccaaa atacctccac 

710 720 730 740 750 

cactgcccac agcccgtccg gagatcaaac tcacaagcac cccaactgat 

760 770 780 790 800 

gccaccaaac tcaataccac ggacccaagc agtgatgatg aggacctcgc 

810 820 830 840 850 

aacatccggc tcagggtccg gagaacgaga accccacaca acttctgatg 

860 870 880 890 900 

cggtcaccaa gcaagggctt tcatcaacaa tgccacccac tccctcacca 

910 920 930 940 950 

caaccaagca cgccacagca aggaggaaac aacacaaacc attcccaaga 

960 970 980 990 1000 

tgctgtgact gaactagaca aaaataacac aactgcacaa ccgtccatgc 

1010 1020 1030 1040 1050 

cccctcataa cactaccaca atctctacta acaacacctc caaacacaac 

1060 1070 1080 1090 1100 

ttcagcactc tctctgcacc attacaaaac accaccaatg acaacacaca 

1110 1120 1130 1140 1150 

gagcacaatc actgaaaatg agcaaaccag tgccccctcg ataacaaccc 

1160 1170 1180 1190 1200 

tgcctccaac gggaaatccc accacagcaa agagcaccag cagcaaaaaa 

1210 1220 1230 1240 1250 

ggccccgcca caacggcacc aaacacgaca aatgagcatt tcaccagtcc 

1260 1270 1280 1290 1300 

tccccccacc cccagctcga ctgcacaaca tcttgtatat ttcagaagaa 

1310 1320 1330 1340 1350 

agcgatcggc aattgtcaat gctcaaccca aatgcaaccc taatttacat 



1360 1370 
tactggacta ctcaggatga 



1380 
aggtgctgca 



1390 
atcggactgg 



1400 
cctggatacc 
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1 A -1 A 

1410 


1 a o a 

1420 


1 A *"> A 

1430 


1440 


1450 


atatttcggg 


ccagcagccg 


agggaattta 


catagagggg 


ctaatgcaca 


i a r a 

1460 


1470 


1480 


149 0 


1500 


atcaagatgg 


tttaatctgt 


gggttgagac 


agctggccaa 


cgagacgact 


1510 


1520 


1530 


1540 


1550 


caagctcttc 


aactgttcct 


gagagccaca 


actgagctac 


gcaccttttc 


•i c r f\ 


1570 


icon 

1580 


1590 


1600 


aatcctcaac 


cgtaaggcaa 


ttgatttctt 


gctgcagcga 


tggggcggca 


1610 


1620 


1630 


1640 


1650 


catgccacat 


tctgggaccg 


gactgctgta 


tcgaaccaca 


tgattggacc 


1660 


1670 


1680 


1690 


1700 


aagaacataa 


cagacaaaat 


tgatcagatt 


attcatgatt 


ttgttgataa 


1710 


1720 


1730 


1740 


1750 


aacccttccg 


gaccaggggg 


acaatgacaa 


ttggtggaca 


ggatggagac 


1760 


1770 


1780 


1790 


1800 


aatggatacc 


ggcaggtatt 


ggagttacag 


gcgttataat 


tgcagttatc 


1810 


1820 


1830 


1840 




gctttattct 


gtatatgcaa 


atttgtcttt 


tagttgaatt 


c 



SEQ ID NO: 4 

MBG1/EB02 amino acid sequence 

MetLysThrThrCysPheLeuIleSerLeuIleLeuIleGlnGlyThr 

LysAsnLeuProIleLeuGluIleAlaSerAsnAsnGlnProGln 

AsnValAspSerValCysSerGlyThrLeuGlnLysThrGluAspVal 

HisLeuMetGlyPheThrLeuSerGlyGlnLysValAlaAspSerProLeu 

GluAlaSerLysArgTrpAlaPheArgThrGlyValProProLysAsn 

ValGluTyrThrGluGlyGluGluAlaLysThrCysTyrAsnlleSerVal 

ThrAspProSerGlyLysSerLeuLeuLeuAspProProThrAsnlleArg 

AspTyrProLysCysLysThrlleHisHisIleGlnGlyGlnAsnPro 

HisAlaGlnGlylleAlaLeuHisLeuTrpGlyAlaPhePheLeuTyrAsp 

ArglleAlaSerThrThrMetTyrArgGlyLysValPheThrGluGlyAsn 

IleAlaAlaMetlleValAsnLysThrValHisLysMetllePheSer 
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ArgGlnGlyGlnGlyTyrArgHisMetAsnLeuThrSerThrAsnLysTyr 

TrpThrSerSerAsnGlyThrGlnThrAsnAspThrGlyCysPheGlyAla 

LeuGlnGluTyrAsnSerThrLysAsnGlnThrCysAlaProSerLys 

IleProProProLeuProThrAlaArgProGluIleLysLeuThrSerThr 

ProThrAspAlaThrLysLeuAsnThrThrAspProSerSerAspAspGlu 

AspLeuAlaThrSerGlySerGlySerGlyGluArgGluProHisThr 

ThrSerAspAlaValThrLysGlnGlyLeuSerSerThrMetProProThr 

ProSerProGlnProSerThrProGlnGlnGlyGlyAsnAsnThrAsnHis 

SerGlnAspAlaValThrGluLeuAspLysAsnAsnThrThrAlaGln 

ProSerMetProProHisAsnThrThrThrlleSerThrAsnAsnThrSer 

LysHisAsnPheSerThrLeuSerAlaProLeuGlnAsnThrThrAsnAsp 

AsnThrGlnSerThrlleThrGluAsnGluGlnThrSerAlaProSer 

1 1 eThrThrLeuProProThrGlyAsnProThrThrAlaLys SerThr Ser 

SerLysLysGlyProAlaThrThrAlaProAsnThrThrAsnGluHisPhe 

ThrSerProProProThrProSerSerThrAlaGlnHisLeuValTyr 

PheArgArgLysArgSerAlalleValAsnAlaGlnProLysCysAsnPro 

AsnLeuHisTyrTrpThrThrGlnAspGluGlyAlaAlalleGlyLeuAla 

TrpIleProTyrPheGlyProAlaAlaGluGlylleTyrlleGluGly 

LeuMetHisAsnGlnAspGlyLeuIleCysGlyLeuArgGlnLeuAlaAsn 

GluThrThrGlnAlaLeuGlnLeuPheLeuArgAlaThrThrGluLeuArg 

ThrPheSerlleLeuAsnArgLysAlalleAspPheLeuLeuGlnArg 

TrpGlyGlyThrCysHisIleLeuGlyProAspCysCysIleGluProHis 

AspTrpThrLysAsnlleThrAspLysIleAspGlnllelleHisAspPhe 

ValAspLysThrLeuProAspGlnGlyAspAsnAspAsnTrpTrpThr 

GlyTrpArgGlnTrpIleProAlaGlylleGlyValThrGlyValllelle 

AlaVallleAlaLeuPheCysIleCysLysPheValPhe^ 



9/29 



SEQ ID NO: 5 

MUSGPl /RVNGP2 nucleotide sequence 

Sequence Range: 1 to 2 046 

10 20 30 40 50 

atgaagacca catgtttcct tatcagtctt atcttaattc aagggacaaa 

60 70 80 90 100 

aaatctcccc attttagaga tagctagtaa taatcaaccc caaaatgtgg 

110 120 130 140 150 

attcggtatg ctccggaact ctccagaaga cagaagacgt ccatctgatg 

160 170 180 190 200 

ggattcacac tgagtgggca aaaagttgct gattcccctt tggaggcatc 

210 220 230 240 250 

caagcgatgg gctttcagga caggtgtacc tcccaagaat gttgagtaca 

260 270 280 290 300 

cagaggggga ggaagccaaa acatgctaca atataagtgt aacggatccc 

310 320 330 340 350 

tctggaaaat ccttgctgtt agatcctcct accaacatcc gtgactatcc 

360 370 380 390 400 

gaaatgcaaa actatccatc atattcaagg tcaaaaccct catgcacagg 

410 420 430 440 450 

ggatcgccct tcatttatgg ggagcatttt ttctgtatga tcgcattgcc 

460 470 480 490 500 

tccacaacaa tgtaccgagg caaagtcttc actgaaggga acatagcagc 

510 520 530 540 550 

tatgattgtc aataagacag tgcacaaaat gattttctcg cggcaaggac 

560 570 580 590 600 

aagggtaccg tcatatgaat ctgacttcta ctaataaata ttggacaagt 

610 620 630 640 650 

agtaacggaa cgcaaacgaa tgacactgga tgtttcggcg ctcttcaaga 

660 670 680 690 700 

atacaattct acaaagaacc aaacatgtgc tccgtccaaa atacctccac 

710 720 730 740 750 

cactgcccac agcccgtccg gagatcaaac tcacaagcac cccaactgat 

760 770 780 790 800 

gccaccaaac tcaataccac ggacccaagc agtgatgatg aggacctcgc 

810 820 830 840 850 
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aacatccggc tcagggtccg gagaacgaga accccacaca acttctgatg 

860 870 880 890 900 

cggtcaccaa gcaagggctt tcatcaacaa tgccacccac tccctcacca 

910 920 930 940 950 

caaccaagca cgccacagca aggaggaaac aacacaaacc attcccaaga 

960 970 980 990 1000 

tgctgtgact gaactagaca aaaataacac aactgcacaa ccgtccatgc 

1010 1020 1030 1040 1050 

cccctcataa cactaccaca atctctacta acaacacctc caaacacaac 

1060 1070 1080 1090 1100 

ttcagcactc tctctgcacc attacaaaac accaccaatg acaacacaca 

1110 1120 1130 1140 1150 

gagcacaatc actgaaaatg agcaaaccag tgccccctcg ataacaaccc 

1160 1170 1180 1190 1200 

tgcctccaac gggaaatccc accacagcaa agagcaccag cagcaaaaaa 

1210 1220 1230 1240 1250 

ggccccgcca caacggcacc aaacacgaca aatgagcatt tcaccagtcc 

1260 1270 1280 1290 1300 

tccccccacc cccagctcga ctgcacaaca tcttgtatat ttcagaagaa 

1310 1320 1330 1340 1350 

agcgatcgat tttctggaaa gaaggtgata tattcccgtt tttagatggg 

1360 1370 1380 1390 1400 

ttaataaata ctgaaattga ttttgatcca atcccaaaca cagaaacaat 

1410 1420 1430 1440 1450 

ctttgatgaa tctcccagct ttaatacttc aactaatgag gaacaacaca 

1460 1470 1480 1490 1500 

ctcccccgaa tatcagttta actttctctt attttcctga taaaaatgga 

1510 1520 1530 1540 1550 

gatactgcct actctgggga aaacgagaat gattgtgatg cagagttgag 

1560 1570 1580 1590 1600 

gatttggagt gtgcaggagg acgatttggc ggcagggctt agctggatac 

1610 1620 1630 1640 1650 

cattttttgg ccctggaatc gaaggactct atactgccgg tttaatcaaa 

1660 1670 1680 1690 1700 

aatcagaaca atttagtttg taggttgagg cgcttagcta atcaaactgc 

1710 1720 1730 1740 1750 
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taaatccttg 


gagctcttgt 


taagggtcac 


aaccgaggaa 


aggacatttt 


1760 


1770 


1780 


1790 


1800 


ccttaatcaa 


taggcatgca 


attgactttt 


tgcttacgag 


gtggggcgga 


1810 


1820 


1830 


1840 


1850 


acatgcaagg 


tgctaggacc 


tgattgttgc 


ataggaatag 


aagatctatc 


1860 


1870 


1880 


1890 


1900 


taaaaatatc 


tcagaacaaa 


tcgacaaaat 


cagaaaggat 


gaacaaaagg 


1910 


1920 


1930 


1940 


1950 


aggaaactgg 


ctggggtcta 


ggtggcaaat 


ggtggacatc 


tgactggggt 


1960 


1970 


1980 


1990 


2000 


gttctcacca 


atttgggcat 


cctgctacta 


ttatctatag 


ctgttctgat 


2010 


2020 


2030 


2040 




tgctctgtcc 


tgtatctgtc 


gtatcttcac 


taaatacatt 


ggatga 



SEQ ID NO: 6 

MUSGPl /RVNGP2 amino acid sequence 

MetLysThrThrCysPheLeuIleSerLeuIleLeuIleGlnGlyThr 

LysAsnLeuProIleLeuGluIleAlaSerAsnAsnGlnProGln 

AsnValAspSerValCysSerGlyThrLeuGlnLysThrGluAspVal 

HisLeuMetGlyPheThrLeuSerGlyGlnLysValAlaAspSerProLeu 

GluAlaSerLysArgTrpAlaPheArgThrGlyValProProLysAsn 

ValGluTyrThrGluGlyGluGluAlaLysThrCysTyrAsnlleSerVal 

ThrAspProSerGlyLysSerLeuLeuLeuAspProProThrAsnlleArg 

AspTyrProLysCysLysThrlleHisHisIleGlnGlyGlnAsnPro 

HisAlaGlnGlylleAlaLeuHisLeuTrpGlyAlaPhePheLeuTyrAsp 

ArglleAlaSerThrThrMetTyrArgGlyLysValPheThrGluGlyAsn 

IleAlaAlaMetlleValAsnLysThrValHisLysMetllePheSer 

ArgGlnGlyGlnGlyTyrArgHisMetAsnLeuThrSerThrAsnLysTyr 

TrpThrSerSerAsnGlyThrGlnThrAsnAspThrGlyCysPheGlyAla 

LeuGlnGluTyrAsnSerThrLysAsnGlnThrCysAlaProSerLys 

IleProProProLeuProThrAlaArgProGluIleLysLeuThrSerThr 
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ProThrAspAlaThrLysLeuAsnThrThrAspProSerSerAspAspGlu 

AspLeuAlaThrSerGlySerGlySerGlyGluArgGluProHisThr 

ThrSerAspAlaValThrLysGlnGlyLeuSerSerThrMetProProThr 

ProSerProGlnProSerThrProGlnGlnGlyGlyAsnAsnThrAsnHis 

SerGlriAspAlaValThrGluLeuAspLysAsnAsnThrThrAlaGln 

ProSerMetProProHisAsnThrThrThrlleSerThrAsnAsnThrSer 

LysHisAsnPheSerThrLeuSerAlaProLeuGlnAsnThrThrAsnAsp 

AsnThrGlnSerThrlleThrGluAsnGluGlnThrSerAlaProSer 

IleThrThrLeuProProThrGlyAsnProThrThrAlaLysSerThrSer 

SerLysLysGlyProAlaThrThrAlaProAsnThrThrAsnGluHisPhe 

ThrSerProProProThrProSerSerThrAlaGlnHisLeuValTyr 

PheArgArgLysArgSerllePheTrpLysGluGlyAspIlePheProPhe 

LeuAspGlyLeuIleAsnThrGluIleAspPheAspProIleProAsnThr 

GluThrllePheAspGluSerProSerPheAsnThrSerThrAsnGlu 

GluGlnHisThrProProAsnlleSerLeuThrPheSerTyrPheProAsp 

LysAsnGlyAspThrAlaTyrSerGlyGluAsnGluAsnAspCysAspAla 

GluLeuArglleTrpSerValGlnGluAspAspLeuAlaAlaGlyLeu 

SerTrpIleProPhePheGlyProGlylleGluGlyLeuTyrThrAlaGly 

LeuIleLysAsnGlnAsnAsnLeuValCysArgLeuArgArgLeuAlaAsn 

GlnThrAlaLysSerLeuGluLeuLeuLeuArgValThrThrGluGlu 

ArgThrPheSerLeuIleAsnArgHisAlalleAspPheLeuLeuThrArg 

TrpGlyGlyThrCysLysValLeuGlyProAspCysCysIleGlylleGlu 

AspLeuSerLysAsnlleSerGluGlnlleAspLysIleArgLysAsp 

GluGlnLysGluGluThrGlyTrpGlyLeuGlyGlyLysTrpTrpThrSer 

AspTrpGlyValLeuThrAsnLeuGlylleLeuLeuLeuLeuSerlleAla 

ValLeuIleAlaLeuSerCysIleCysArgllePheThrLysTyrlleGly^ 
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SEQ ID NO: 7 

RVNGPl /MUSGP2 nucleotide sequence 

10 20 30 40 50 

atgaagacca tatattttct gattagtctc attttaatcc aaagtataaa 

60 70 80 90 100 

aactctccct gttttagaaa ttgctagtaa cagccaacct caagatgtag 

110 120 130 140 150 

attcagtgtg ctccggaacc ctccaaaaga cagaagatgt tcatctgatg 

160 170 180 190 200 

ggatttacac tgagtgggca aaaagttgct gattcccctt tggaagcatc 

210 220 230 240 250 

taaacgatgg gctttcagga caggtgttcc tcccaagaac gttgagtata 

260 270 280 290 300 

cggaaggaga agaagccaaa acatgttaca atataagtgt aacagaccct 

310 320 330 340 350 

tctggaaaat ccttgctgct ggatcctccc agtaatatcc gcgattaccc 

360 370 380 390 400 

taaatgtaaa actgttcatc atattcaagg tcaaaaccct catgcacagg 

410 420 430 440 450 

ggattgccct ccatttgtgg ggggcatttt tcttgtatga tcgcgttgcc 

460 470 480 490 500 

tctacaacaa tgtaccgagg caaggtcttc actgaaggaa atatagcagc 

510 520 530 540 550 

tatgattgtt aataagacag ttcacagaat gattttttct aggcaaggac 

560 570 580 590 600 

aaggttatcg tcacatgaac ttgacctcca ccaataaata ttggacaagc 

610 620 630 640 650 

agcaatgaaa cgcagagaaa tgatacggga tgttttggca tcctccaaga 

660 670 680 690 700 

atacaactcc acaaacaatc aaacatgccc tccatctctt aaacctccat 

710 720 730 740 750 

ccctgcccac agtaactccg agcattcact ctacaaatac tcaaattaat 

760 770 780 790 800 

actgctaaat ctggaactat gaacccaagt agcgacgatg aggaccttat 



810 820 830 840 850 

gatttccggc tcaggatctg gagaacaggg gccccacaca actcttaatg 
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860 870 880 890 900 

tagtcactga acagaaacaa tcgtcaacaa tattgtccac tccttcacta> 

910 920 930 940 950 

catccaagca cctcacaaca tgagcaaaac agtacgaatc cttcccgaca 

960 970 980 990 1000 

tgctgtaact gagcacaatg gaaccgaccc aacaacacaa ccagcaacgc 

1010 1020 1030 1040 1050 

tcctcaacaa tactaataca actcccacct ataacactct caagtacaac 

1060 1070 1080 1090 1100 

ctcagtactc cttcccctcc aacccgcaac atcaccaata atgatacaca 

1110 1120 1130 1140 1150 

acgtgaacta gcagaaagcg aacaaaccaa tgctcagttg aacacaactc 

1160 1170 1180 1190 1200 

tagatccaac agaaaatccc accacaggac aagacaccaa cagcacaacc 

1210 1220 1230 1240 1250 

aacatcatca tgacgacatc agatataaca agcaaacacc ccacaaattc 

1260 1270 1280 1290 1300 

ttctccggat tctagtccga caacccgccc tcctatatac tttagaaaga 

1310 1320 1330 1340 1350 

aacgatcgat cctctggagg gaaggcgaca tgttcccttt tctggatggg 

1360 1370 1380 1390 1400 

ttaataaatg ctccaattga ttttgaccca gttccaaata caaaaacaat 

1410 1420 1430 1440 1450 

ctttgatgaa tcctctagtt ctggtgcctc ggctgaggaa gatcaacatg 

1460 1470 1480 1490 1500 

cctcccccaa tattagttta actttatctt attttcctaa tataaatgag 

1510 1520 1530 1540 1550 

aacactgcct actctggaga aaatgagaat gattgtgatg cagagttaag 

1560 1570 1580 1590 1600 

aatttggagc gttcaggagg atgacctggc cgcagggctc agttggatac 

1610 1620 1630 1640 1650 

cgttttttgg ccctggaatt gaaggacttt acactgctgt tttaattaaa> 

1660 1670 1680 1690 1700 

aatcaaaaca atttggtctg caggttgagg cgtctagcca atcaaactgc 

1710 1720 1730 1740 1750 

caaatccttg gaactcttat tgagagtcac aactgaggaa agaacattct 

1760 1770 1780 1790 1800 
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ccttaatcaa 


tagacatgct 


attgactttc 


tactcacaag 


atggggagga 


1810 


1820 


1830 


1840 


1850 


acatgcaaag 


tgcttggacc 


tgattgttgc 


atcgggatag 


aagacttgtc 


1860 


1870 


1880 


1890 


1900 


caaaaatatt 


tcagagcaaa 


ttgaccaaat 


taaaaaggac 


gaacaaaaag 


1910 


1920 


1930 


1940 


1950 


aggggactgg 


ttggggtctg 


ggtggtaaat 


ggtggacatc 


cgactggggt 


1960 


1970 


1980 


1990 


2000 


gttcttacta 


acttgggcat 


tttgctacta 


ttatccatag 


ctgtcttgat 


2010 


2020 


2030 


2040 


2050 


tgctctatcc 


tgtatttgtc 


gtatctttac 


taaatatatc 


ggataa 



SEQ ID NO: 8 

RVNGP1/MUSGP2 amino acid sequence 



MetLysThrlleTyrPheLeuIleSerLeuIleLeuIleGlnSerlleLys 

ThrLeuProValLeuGluIleAlaSerAsnSerGlnProGlnAspVal 

AspSerValCysSerGlyThrLeuGlnLysThrGluAspValHisLeuMet 

GlyPheThrLeuSerGlyGlnLysValAlaAspSerProLeuGliiAlaSer 

LysArgTrpAlaPheArgThrGlyValProProLysAsnValGluTyr 

ThrGluGlyGluGluAlaLysThrCysTyrAsnlleSerValThrAspPro 

SerGlyLysSerLeuLeuLeuAspProProSerAsnlleArgAspTyrPro 

LysCysLysThrValHisHisIleGlnGlyGlnAsnProHisAlaGln 

GlylleAlaLeuHisLeuTrpGlyAlaPhePheLeuTyrAspArgValAla 

SerThrThrMetTyrArgGlyLysValPheThrGluGlyAsnlleAlaAla 

MetlleValAsnLysThrValHisArgMetllePheSerArgGlnGly 

GlnGlyTyrArgHisMetAsnLeuThrSerThrAsnLysTyrTrpThrSer 

SerAsnGluThrGlnArgAsnAspThrGlyCysPheGlylleLeuGlnGlu 

TyrAsnSerThrAsnAsnGlnThrCysProProSerLeuLysProPro 

SerLeuProThrValThrProSerlleHisSerThrAsnThrGlnlleAsn 

ThrAlaLysSerGlyThrMetAsnProSerSerAspAspGluAspLeuMet 

IleSerGlySerGlySerGlyGluGlnGlyProHisThrThrLeuAsn 

ValValThrGluGlnLysGlnSerSerThrlleLeuSerThrProSerLeu 

HisProSerThrSerGlnHisGluGlnAsnSerThrAsnProSerArgHis 

AlaValThrGluHisAsnGlyThrAspProThrThrGlnProAlaThr 
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LeuLeuAsnAsnThrAsnThrThrProThrTyrAsnThrLeuLysTyrAsn 

LeuSerThrProSerProProThrArgAsnlleThrAsnAsnAspThrGln 

ArgGluLeuAlaGluSerGluGlnThrAsnAlaGlnLeuAsnThrThr 

LeuAspProThrGluAsnProThrThrGlyGlnAspThrAsnSerThrThr 

AsnllelleMetThrThrSerAspIleThrSerLysHisProThrAsnSer 

SerProAspSerSerProThrThrArgProProIleTyrPheArgLys 

LysArgSerlleLeuTrpArgGluGlyAspMetPheProPheLeuAspGly 

Leu 1 1 e AsnAl aPr o 1 1 e AspPheAspP r oVa 1 P r OAs nThr Ly s Thr lie 

PheAspGluSerSerSerSerGlyAlaSerAlaGluGluAspGlnHis 

AlaSerProAsnlleSerLeuThrLeuSerTyrPheProAsnlleAsnGlu 

AsnThrAlaTyrSerGlyGluAsnGluAsnAspCysAspAlaGluLeuArg 

IleTrpSerValGlnGluAspAspLeuAlaAlaGlyLeuSerTrpIle 

ProPhePheGlyProGlylleGluGlyLeuTyrThrAlaValLeuIleLys 

AsnGlnAsnAsnLeuValCysArgLeuArgArgLeuAlaAsnGlnThrAla 

LysSerLeuGluLeuLeuLeuArgValThrThrGluGluArgThrPhe 

SerLeuIleAsnArgHisAlalleAspPheLeuLeuThrArgTrpGlyGly 

ThrCysLysValLeuGlyProAspCysCysIleGlylleGluAspLeuSer 

LysAsnlleSerGluGlnlleAspGlnlleLysLysAspGluGlnLys 

GluGlyThrGlyTrpGlyLeuGlyGlyLysTrpTrpThrSerAspTrpGly 

ValLeuThrAsnLeuGlylleLeuLeuLeuLeuSerlleAlaValLeuIle 

A 1 aLeuS er Cy s I 1 eCy s Arg lie PheThr Ly s Ty r I 1 eGly * 

SEQ ID NO: 9 

MBGGP 1 / MBGGP 2 nucleotide sequence 

10 20 30 40 50 

aagcttaaca tgaagaccac atgtttcctt atcagtctta tcttaattca 

60 70 80 90 100 

agggacaaaa aatctcccca ttttagagat agctagtaat aatcaacccc 

110 120 130 140 150 

aaaatgtgga ttcggtatgc tccggaactc tccagaagac agaagacgtc 

160 170 180 190 200 

catctgatgg gattcacact gagtgggcaa aaagttgctg attccccttt 

210 220 230 240 250 
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ggaggcatcc aagcgatggg ctttcaggac aggtgtacct cccaagaatg 

260 270 280 290 300 

ttgagtacac agagggggag gaagccaaaa catgctacaa tataagtgta 

310 320 330 340 350 

acggatccct ctggaaaatc cttgctgtta gatcctccta ccaacatccg 

360 370 380 390 400 

tgactatccg aaatgcaaaa ctatccatca tattcaaggt caaaaccctc 

410 420 430 440 450 

atgcacaggg gatcgccctt catttatggg gagcattttt tctgtatgat 

460 470 480 490 500 

cgcattatgt accgaggcaa agtcttcact gaagggaaca tagcagctat 

510 520 530 540 550 

gattgtcaat aagacagtgc acaaaatgat tttctcgcgg caaggacaag 

560 570 580 590 600 

ggtaccgtca tatgaatctg acttctacta ataaatattg gacaagtagt 

610 620 630 640 650 

aacggaacgc aaacgaatga cactggatgt ttcggcgctc ttcaagaata 

660 670 680 690 700 

caattctaca aagaaccaaa catgtgctcc gtccaaaata cctccaccac 

710 720 730 740 750 

tgcccacagc ccgtccggag atcaaactca caagcacccc aactgatgcc 

760 770 780 790 800 

accaaactca ataccacgga cccaagcagt gatgatgagg acctcgcaac 

810 820 830 840 850 

atccggctca gggtccggag aacgagaacc ccacacaact tctgatgcgg 

860 870 880 890 900 

tcaccaagca agggctttca tcaacaatgc cacccactcc ctcaccacaa 

910 920 930 940 950 

ccaagcacgc cacagcaagg aggaaacaac acaaaccatt cccaagatgc 

960 970 980 990 1000 

tgtgactgaa ctagacaaaa ataacacaac tgcacaaccg tccatgcccc 

1010 1020 1030 1040 1050 

ctcataacac taccacaatc tctactaaca acacctccaa acacaacttc 

1060 1070 1080 1090 1100 

agcactctct ctgcaccatt acaaaacacc accaatgaca acacacagag 

1110 1120 1130 1140 1150 
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cacaatcact gaaaatgagc aaaccagtgc cccctcgata acaaccctgc 

1160 1170 1180 1190 1200 

ctccaacggg aaatcccacc acagcaaaga gcaccagcag caaaaaaggc 

1210 1220 1230 1240 1250 

cccgccacaa cggcaccaaa cacgacaaat gagcatttca ccagtcctcc 

1260 1270 1280 1290 1300 

ccccaccccc agctcgactg cacaacatct tgtatatttc agaagaaagc 

1310 1320 1330 1340 1350 

gatcgatcct ctggagggaa ggcgacatgt tcccttttct ggatgggtta 

1360 1370 1380 1390 1400 

ataaatgctc caattgattt tgacccagtt ccaaatacaa aaacaatctt 

1410 1420 1430 1440 1450 

tgatgaatcc tctagttctg gtgcctcggc tgaggaagat caacatgcct 

1460 1470 1480 1490 1500 

cccccaatat tagtttaact ttatcttatt ttcctaatat aaatgagaac 

1510 1520 1530 1540 1550 

actgcctact ctggagaaaa tgagaatgat tgtgatgcag agttaagaat 

1560 1570 1580 1590 1600 

ttggagcgtt caggaggatg acctggccgc agggctcagt tggataccgt 

1610 1620 1630 1640 1650 

tttttggccc tggaattgaa ggactttaca ctgctgtttt aattaaaaat 

1660 1670 1680 1690 1700 

caaaacaatt tggtctgcag gttgaggcgt ctagccaatc aaactgccaa 

1710 1720 1730 1740 1750 

atccttggaa ctcttattga gagtcacaac tgaggaaaga acattctcct 

1760 1770 1780 1790 1800 

taatcaatag acatgctatt gactttctac tcacaagatg gggaggaaca 

1810 1820 1830 1840 1850 

tgcaaagtgc ttggacctga ttgttgcatc gggatagaag acttgtccaa 

1860 1870 1880 1890 1900 

aaatatttca gagcaaattg accaaattaa aaaggacgaa caaaaagagg 

1910 1920 1930 1940 1950 

ggactggttg gggtctgggt ggtaaatggt ggacatccga ctggggtgtt 

1960 1970 1980 1990 2000 

cttactaact tgggcatttt gctactatta tccatagctg tcttgattgc 

2010 2020 2030 2040 2050 
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tctatcctgt atttgtcgta tctttactaa atatatcgga taacggaatt 
c 

SEQ ID NO: 10 

MBGGPl /MBGGP2 amino acid sequence 

MetLysThrThrCysPheLeuIleSerLeuIleLeuIleGlnGlyThr 

LysAsnLeuProIleLeuGluIleAlaSerAsnAsnGlnProGln 

AsnValAspSerValCysSerGlyThrLeuGlnLysThrGluAspVal 

HisLeuMetGlyPheThrLeuSerGlyGlnLysValAlaAspSerProLeu 

GluAlaSerLysArgTrpAlaPheArgThrGlyValProProLysAsn 

ValGluTyrThrGluGlyGluGluAlaLysThrCysTyrAsnlleSerVal 

ThrAspProSerGlyLysSerLeuLeuLeuAspProProThrAsnlleArg 

AspTyrProLysCysLysThrlleHisHisIleGlnGlyGlnAsnPro 

HisAlaGlnGlylleAlaLeuHisLeuTrpGlyAlaPhePheLeuTyrAsp 

ArglleAlaSerThrThrMetTyrArgGlyLysValPheThrGluGlyAsn 

IleAlaAlaMetlleValAsnLysThrValHisLysMetllePheSer 

ArgGlnGlyGlnGlyTyrArgHisMetAsnLeuThrSerThrAsnLysTyr 

TrpThrSerSerAsnGlyThrGlnThrAsnAspThrGlyCysPheGlyAla 

LeuGlnGluTyrAsnSerThrLysAsnGlnThrCysAlaProSerLys 

IleProProProLeuProThrAlaArgProGluIleLysLeuThrSerThr 

ProThrAspAlaThrLysLeuAsnThrThrAspProSerSerAspAspGlu 

AspLeuAlaThrSerGlySerGlySerGlyGluArgGluProHisThr 

ThrSerAspAlaValThrLysGlnGlyLeuSerSerThrMetProProThr 

ProSerProGlnProSerThrProGlnGlnGlyGlyAsnAsnThrAsnHis 

SerGlnAspAlaValThrGluLeuAspLysAsnAsnThrThrAlaGln 

ProSerMetProProHisAsnThrThrThrlleSerThrAsnAsnThrSer 

LysHisAsnPheSerThrLeuSerAlaProLeuGlnAsnThrThrAsnAsp 

AsnThrGlnSerThrlleThrGluAsnGluGlnThrSerAlaProSer 
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IleThrThrLeuProProThrGlyAsnProThrThrAlaLysSerThrSer 

SerLysLysGlyProAlaThrThrAlaProAsnThrThrAsnGluHisPhe 

ThrSerProProProThrProSerSerThrAlaGlnHisLeuValTyr 

PheArgArgLysArgSerlleLeuTrpArgGluGlyAspMetPheProPhe 

LeuAspGlyLeuIleAsnAlaProIleAspPheAspProValProAsnThr 

LysThrllePheAspGluSerSerSerSerGlyAlaSerAlaGluGlu 

AspGlnHisAlaSerProAsnlleSerLeuThrLeuSerTyrPheProAsn 

1 1 e As nGluAsnThr Al aTy r S e r G lyG luAsnG luAsnAspCy s AspAl a 

GluLenArglleTrpSerValGlnGluAspAspLeuAlaAlaGlyLeu 

SerTrpIleProPhePheGlyProGlylleGluGlyLeuTyrThrAlaVal 

LeuIleLysAsnGlnAsnAsnLeuValCysArgLeuArgArgLeuAlaAsn 

GlnThrAlaLysSerLeuGluLeuLeuLeuArgValThrThrGluGlu 

Ar gThr Phe S e r Leu 1 1 e AsnAr gH i s A 1 a 1 1 eAsp PheLeuLeuThr Ar g 

TrpGlyGlyThrCysLysValLeuGlyProAspCysCysIleGlylleGlu 

AspLeuSerLysAsnlleSerGluGlnlleAspGlnlleLysLysAsp 

GluGlnLysGluGlyThrGlyTrpGlyLeuGlyGlyLysTrpTrpThrSer 

AspTrpGlyValLeuThrAsnLeuGlylleLeuLeuLeuLeuSerlleAla 

ValLeuI leAlaLeuSerCys I 1 eCy s Ar g lie PheThr Ly s Ty r I 1 e 

Gly* 

SEQ ID NO: 11 

EBOGP1 /EBOGP2 nucleotide sequence 

10 20 30 40 50 

atgggcgtta caggaatatt gcagttacct cgtgatcgat tcaagaggac 

60 70 80 90 100 

atcattcttt ctttgggtaa ttatcctttt ccaaagaaca ttttccatcc 

110 120 130 140 150 

cacttggagt catccacaat agcacattac aggttagtga tgtcgacaaa 

160 170 180 190 200 

ctagtttgtc gtgacaaact gtcatccaca aatcaattga gatcagttgg 
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210 220 230 240 250 

actgaatctc gaagggaatg gagtggcaac tgacgtgcca tctgcaacta 

260 270 280 290 300 

aaagatgggg cttcaggtcc ggtgtcccac caaaggtggt caattatgaa 

310 320 330 340 350 

gctggtgaat gggctgaaaa ctgctacaat cttgaaatca aaaaacctga 

360 370 380 390 400 

cgggagtgag tgtctaccag cagcgccaga cgggattcgg ggcttccccc 

410 420 430 440 450 

ggtgccggta tgtgcacaaa gtatcaggaa cgggaccgtg tgccggagac 

460 470 480 490 500 

tttgccttcc ataaagaggg tgctttcttc ctgtatgatc gacttgcttc 

510 520 530 540 550 

cacagttatc taccgaggaa cgactttcgc tgaaggtgtc gttgcatttc 

560 570 580 590 600 

tgatactgcc ccaagctaag aaggacttct tcagctcaca ccccttgaga 

610 620 630 640 650 

gagccggtca atgcaacgga ggacccgtct agtggctact attctaccac 

660 670 680 690 700 

aattagatat caggctaccg gttttggaac caatgagaca gagtacttgt 

710 720 730 740 750 

tcgaggttga caatttgacc tacgtccaac ttgaatcaag attcacacca 

760 770 780 790 800 

cagtttctgc tccagctgaa tgagacaata tatacaagtg ggaaaaggag 

810 820 830 840 850 

caataccacg ggaaaactaa tttggaaggt caaccccgaa attgatacaa 

860 870 880 890 900 

caatcgggga gtgggccttc tgggaaacta aaaaaaacct cactagaaaa 

910 920 930 940 950 

attcgcagtg aagagttgtc tttcacagtt gtatcaaacg gagccaaaaa 

960 970 980 990 1000 

catcagtggt cagagtccgg cgcgaacttc ttccgaccca gggaccaaca 

1010 1020 1030 1040 1050 

caacaactga agaccacaaa atcatggctt cagaaaattc ctctgcaatg 

1060 1070 1080 1090 1100 

gttcaagtgc acagtcaagg aagggaagct gcagtgtcgc atctaacaac 
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1110 1120 1130 1140 1150 

ccttgccaca atctccacga gtccccaatc cctcacaacc aaaccaggtc 

1160 1170 1180 1190 1200 

cggacaacag cacccataat acacccgtgt ataaacttga catctctgag 

1210 1220 1230 1240 1250 

gcaactcaag ttgaacaaca tcaccgcaga acagacaacg acagcacagc 

1260 1270 1280 1290 1300 

ctccgacact ccctctgcca cgaccgcagc cggaccccca aaagcagaga 

1310 1320 1330 1340 1350 

acaccaacac gagcaagagc actgacttcc tggaccccgc caccacaaca 

1360 1370 1380 1390 1400 

agtccccaaa accacagcga gaccgctggc aacaacaaca ctcatcacca 

1410 1420 1430 1440 1450 

agataccgga gaagagagtg ccagcagcgg gaagctaggc ttaattacca 

1460 1470 1480 1490 1500 

atactattgc tggagtcgca ggactgatca caggcgggag aagaactcga 

1510 1520 1530 1540 1550 

cgatcggcaa ttgtcaatgc tcaacccaaa tgcaacccta atttacatta 

1560 1570 1580 1590 1600 

ctggactact caggatgaag gtgctgcaat cggactggcc tggataccat 

1610 1620 1630 1640 1650 

atttcgggcc agcagccgag ggaatttaca tagaggggct aatgcacaat 

1660 1670 1680 1690 1700 

caagatggtt taatctgtgg gttgagacag ctggccaacg agacgactca 

1710 1720 1730 1740 1750 

agctcttcaa ctgttcctga gagccacaac tgagctacgc accttttcaa 

1760 1770 1780 1790 1800 

tcctcaaccg taaggcaatt gatttcttgc tgcagcgatg gggcggcaca 

1810 1820 1830 1840 1850 

tgccacattc tgggaccgga ctgctgtatc gaaccacatg attggaccaa 

1860 1870 1880 1890 1900 

gaacataaca gacaaaattg atcagattat tcatgatttt gttgataaaa 

1910 1920 1930 1940 1950 

cccttccgga ccagggggac aatgacaatt ggtggacagg atggagacaa 

1960 1970 1980 1990 2000 

tggataccgg caggtattgg agttacaggc gttataattg cagttatcgc 
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2010 2020 2030 

tttattctgt atatgcaaat ttgtctttta gttgaattc 

SEQ ID NO: 12 

EBOGP 1 / EBOGP 2 amino acid sequence 

MetGlyValThrGlylleLeuGlnLeuProArgAspArgPheLys 

ArgThrSerPhePheLeuTrpValllelleLeuPheGlnArgThrPhe 

SerlleProLeuGlyVallleHisAsnSerThrLeuGlnValSerAsp 

ValAspLysLeuValCysArgAspLysLeuSerSerThrAsnGlnLeuArg 

SerValGlyLeuAsnLeuGluGlyAsnGlyValAlaThrAspValProSer 

AlaThrLysArgTrpGlyPheArgSerGlyValProProLysValVal 

AsnTyrGluAlaGlyGluTrpAlaGluAsnCysTyrAsnLeuGluIleLys 

LysProAspGlySerGluCysLeuProAlaAlaProAspGlylleArgGly 

PheProArgCysArgTyrValHisLysValSerGlyThrGlyProCys 

AlaGlyAspPheAlaPheHisLysGluGlyAlaPhePheLeuTyrAspArg 

LeuAlaSerThrVallleTyrArgGlyThrThrPheAlaGluGlyValVal 

AlaPheLeuIleLeuProGlnAlaLysLysAspPhePheSerSerHis 

ProLeuArgGluProValAsnAlaThrGluAspProSerSerGlyTyrTyr 

SerThrThrlleArgTyrGlnAlaThrGlyPheGlyThrAsnGluThrGlu 

TyrLeuPheGluValAspAsnLeuThrTyrValGlnLeuGluSerArg 

PheThrProGlnPheLeuLeuGlnLeuAsnGluThrlleTyrThrSerGly 

LysArgSerAsnThrThrGlyLysLeuIleTrpLysValAsnProGluIle 

AspThrThrlleGlyGluTrpAlaPheTrpGluThrLysLysAsnLeu 

ThrArgLysIleArgSerGluGluLeuSerPheThrValValSerAsnGly 

AlaLysAsnlleSerGlyGlnSerProAlaArgThrSerSerAspProGly 

ThrAsnThrThrThrGluAspHi sLy s IleMetAlaSerGluAsnSer 

SerAlaMetValGlnValHisSerGlnGlyArgGluAlaAlaValSerHis 

LeuThrThrLeuAlaThrlleSerThrSerProGlnSerLeuThrThrLys 

ProGlyProAspAsnSerThrHisAsnThrProValTyrLysLeuAsp 
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IleSerGluAlaThrGlnValGluGlnHisHisArgArgThrAspAsnAsp 

SerThrAlaSerAspThrProSerAlaThrThrAlaAlaGlyProProLys 

AlaGluAsnThrAsnThrSerLysSerThrAspPheLeuAspProAla 

ThrThrThrSerProGlnAsnHisSerGluThrAlaGlyAsnAsnAsnThr 

HisHisGlnAspThrGlyGluGluSerAlaSerSerGlyLysLeuGlyLeu> 

IleThrAsnThrlleAlaGlyValAlaGlyLeuIleThrGlyGlyArg 

ArgThrArgArgSerAlalleValAsnAlaGlnProLysCysAsnProAsn 

LeuHisTyrTrpThrThrGlnAspGluGlyAlaAlaIleGlyLe\aAlaTrp 

IleProTyrPheGlyProAlaAlaGluGlylleTyrlleGluGlyLeu 

MetHisAsnGlnAspGlyLeuIleCysGlyLeuArgGlnLeuAlaAsnGlu 

ThrThrGlnAlaLeuGlnLeuPheLeuArgAlaThrThrGluLeuArgThr 

PheSerlleLeuAsnArgLysAlalleAspPheLeuLeuGlnArgTrp 

GlyGlyThrCysHisIleLeuGlyProAspCysCysIleGluProHisAsp 

TrpThrLysAsnlleThrAspLysIleAspGlnllelleHisAspPheVal 

AspLysThrLeuProAspGlnGlyAspAsnAspAsnTrpTrpThrGly 

TrpArgGlnTrpIleProAlaGlylleGlyValThrGlyValllelleAla 

VallleAlaLeu PheC ys 1 1 eCy s Ly s PheVal Phe * 

SEQ ID NO: 13 

RVNGPl /RVNGP2 nucleotide sequence 

10 20 30 40 50 

atgaagacca tatattttct gattagtctc attttaatcc aaagtataaa 

60 70 80 90 100 

aactctccct gttttagaaa ttgctagtaa cagccaacct caagatgtag 

110 120 130 140 150 

attcagtgtg ctccggaacc ctccaaaaga cagaagatgt tcatctgatg 

160 170 180 190 200 

ggatttacac tgagtgggca aaaagttgct gattcccctt tggaagcatc 

210 220 230 240 250 

taaacgatgg gctttcagga caggtgttcc tcccaagaac gttgagtata 
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260 


270 


280 


290 


300 


cggaaggaga 


agaagccaaa 


acatgttaca 


dL.aL.ddy Ly L 


Pi Pi C Pi CTPi C'C C t~ 


310 


320 


330 ' 


340 


350 


tctggaaaat 


ccttgctgct 


ggaucc lccc 


zsr*\~P\z*\~P*t rr 
dy LddLdH^-^ 


nrrra f taccc 

y y c^l twA. v^* 1 — - "o- 


360 


370 


380 


390 


400 


taaatgtaaa 


actgttcatc 


acauucaagg 






410 


420 


430 


440 


450 


ggattgccct 


ccatttgtgg 


ggggcatt tt 


LL- ULy uauy a 


frcrrattacc 


460 


470 


480 


490 


500 


tctacaacaa 


tgtaccgagg 


caag y l-l. l. l.v_ 


d Lyaayy a. a. 


atatagcagc 


510 


520 


530 


540 


550 


tatgattgtt 


aataagacag 


E Lcacay aau 


era f- 1 f- 1 tt r t~ 


a crcrcaacrcrac 


560 


570 


580 


590 


600 


aaggttatcg 


tcacatgaac 


ttgacctcca 


ccaataaata 


cuyy dCddy l. 


610 


620 


630 


640 


650 


agcaatgaaa 


cgcagagaaa 


tgatacggga 


tgtt t tggca 


UtO L L- tctcty d 


660 


670 


680 


690 


700 


atacaactcc 


acaaacaatc 


aaacatgccc 


4— ?a +■-/"■•« 4— y~ < 4— 4- 

tCCaCCLCt.L 




710 


720 


730 


740 


750 


ccctgcccac 


agtaactccg 


agcattcact 


ctiacaaauac 


LLaaaL L. dd L. 


760 


770 


780 


790 


800 


actgctaaat 


ctggaactat 


gaacccaagt 


agegacgatg 


aggacc uua l. 


810 


820 


830 


840 


850 


gatttccggc 


tcaggatctg 


gagaacaggg 




pir'tntf'PiPif'Ci 


860 


870 


880 


890 


900 


tagtcactga 


acagaaacaa 


UCg UCadCaa 




trrf f racta 


910 


920 


930 


940 


950 


catccaagca 


cctcacaaca 


tgagcaaaac 


agtacgaatc 


c u ucccgacd 


960 


970 


980 


990 


1000 


tgctgtaact 


gagcacaatg 


gaaccgaccc 


aacaacacaa 


ccagcadcgc 


1010 


1020 


1030 


1040 


1050 


tcctcaacaa 


tactaataca 


actcccacct 


ataacactct 


caagtacaac 


1060 


1070 


1080 


1090 


1100 


ctcagtactc 


cttcccctcc 


aacccgcaac 


atcaccaata 


atgatacaca 



1110 1120 1130 1140 1150 

aegtgaacta geagaaageg aacaaaccaa tgctcagttg aacacaactc 
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1160 1170 1180 1190 1200 

tagatccaac agaaaatccc accacaggac aagacaccaa cagcacaacc 

1210 1220 1230 1240 1250 

aacatcatca tgacgacatc agatataaca agcaaacacc ccacaaattc 

1260 1270 1280 1290 1300 

ttctccggat tctagtccga caacccgccc tcctatatac tttagaaaga 

1310 1320 1330 1340 1350 

aacgatcgat tttctggaaa gaaggtgata tattcccgtt tttagatggg 

1360 1370 1380 1390 1400 

ttaataaata ctgaaattga ttttgatcca atcccaaaca cagaaacaat 

1410 1420 1430 1440 1450 

ctttgatgaa tctcccagct ttaatacttc aactaatgag gaacaacaca 

1460 1470 1480 1490 1500 

ctcccccgaa tatcagttta actttctctt attttcctga taaaaatgga 

1510 1520 1530 1540 1550 

gatactgcct actctgggga aaacgagaat gattgtgatg cagagttgag 

1560 1570 1580 1590 1600 

gatttggagt gtgcaggagg acgatttggc ggcagggctt agctggatac 

1610 1620 1630 1640 1650 

cattttttgg ccctggaatc gaaggactct atactgccgg tttaatcaaa 

1660 1670 1680 1690 1700 

aatcagaaca atttagtttg taggttgagg cgcttagcta atcaaactgc 

1710 1720 1730 1740 1750 

taaatccttg gagctcttgt taagggtcac aaccgaggaa aggacatttt 

1760 1770 1780 1790 1800 

ccttaatcaa taggcatgca attgactttt tgcttacgag gtggggcgga 

1810 1820 1830 1840 1850 

acatgcaagg tgctaggacc tgattgttgc ataggaatag aagatctatc 

1860 1870 1880 1890 1900 

taaaaatatc tcagaacaaa tcgacaaaat cagaaaggat gaacaaaagg 

1910 1920 1930 1940 1950 

aggaaactgg ctggggtcta ggtggcaaat ggtggacatc tgactggggt 

I960 1970 1980 1990 2000 

gttctcacca atttgggcat cctgctacta ttatctatag ctgttctgat 

2010 2020 2030 2040 

tgctctgtcc tgtatctgtc gtatcttcac taaatacatt ggatga 
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SEQ ID NO: 14 

RVNGP1/RWGP2 amino acid sequence 

MetLysThrlleTyrPheLeuIleSerLeuIleLeuIleGlnSerlleLys 

ThrLeuProValLeuGluIleAlaSerAsnSerGlnProGlnAspVal 

AspSerValCysSerGlyThrLeuGlnLysThrGluAspValHisLeuMet 

GlyPheThrLeuSerGlyGlnLysValAlaAspSerProLeuGluAlaSer 

LysArgTrpAlaPheArgThrGlyValProProLysAsnValGluTyr 

ThrGluGlyGluGluAlaLysThrCysTyrAsnlleSerValThrAspPro 

SerGlyLysSerLeuLeuLeuAspProProSerAsnlleArgAspTyrPro 

LysCysLysThrValHisHisIleGlnGlyGlnAsnProHisAlaGln 

GlylleAlaLeuHisLeuTrpGlyAlaPhePheLeuTyrAspArgValAla 

SerThrThrMetTyrArgGlyLysValPheThrGluGlyAsnlleAlaAla 

MetlleValAsnLysThrValHisArgMetllePheSerArgGlnGly 

GlnGlyTyrArgHisMetAsnLeuThrSerThrAsnLysTyrTrpThrSer 

SerAsnGluThrGlnArgAsnAspThrGlyCysPheGlylleLeuGlnGlu 

TyrAsnSerThrAsriAsnGlnThrCysProProSerLeuLysProPro 

SerLeuProThrValThrProSerlleHisSerThrAsnThrGlnlleAsn 

ThrAlaLysSerGlyThrMetAsnProSerSerAspAspGluAspLeioMet 

IleSerGlySerGlySerGlyGluGlnGlyProHisThrThrLeuAsn 

ValValThrGluGlnLysGlnSerSerThrlleLeuSerThrProSerLeu 

HisProSerThrSerGlnHisGluGlnAsnSerThrAsnProSerArgHis 

AlaValThrGluHisAsnGlyThrAspProThrThrGlnProAlaThr 

LeuLeiaAsnAsnThrAsnThrThrProThrTyrAsnThrLeuLysTyrAsn 

LeuSerThrProSerProProThrArgAsnlleThrAsnAsnAspThrGln 

ArgGluLeuAlaGluSerGluGlnThrAsnAlaGlnLeuAsnThrThr 

LeuAspProThrGluAsnProThrThrGlyGlnAspThrAsnSerThrThr 

AsnllelleMetThrThrSerAspIleThrSerLysHisProThrAsnSer 

SerProAspSerSerProThrThrArgProProIleTyrPheArgLys 

LysArgSerllePheTrpLysGluGlyAspIlePheProPheLeuAspGly 

LeuIleAsnThrGluIleAspPheAspProIleProAsnThrGluThrlle 

PheAspGluSerProSerPheAsnThrSerThrAsnGluGluGlnHis 

ThrProProAsnlleSerLeuThrPheSerTyrPheProAspLysAsnGly 
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AspThrAlaTyrSerGlyGluAsnGluAsiiAspCysAspAlaGluLeuArg 

IleTrpSerValGlnGluAspAspLeuAlaAlaGlyLeuSerTrpIle 

ProPhePheGlyProGlylleGluGlyLeuTyrThrAlaGlyLeuIleLys 

AsnGlnAsnAsnLeuValCysArgLeuArgArgLeuAlaAsnGlnThrAla 

LysSerLeuGluLeuLeuLeuArgValThrThrGluGluArgThrPhe 

SerLeuIleAsnArgHisAlalleAspPheLeuLeuThrArgTrpGlyGly 

ThrCysLysValLeuGlyProAspCysCysIleGlylleGluAspLeuSer 

LysAsnlleSerGluGlnlleAspLysIleArgLysAspGluGlnLys 

GluGluThrGlyTrpGlyLeuGlyGlyLysTrpTrpThrSerAspTrpGly 

ValLeuThrAsnLeuGlylleLeuLeuLeuLeuSerlleAlaValLeuIle 

AlaLeuSerCysIleCysArgllePheThrLysTyrlleGly* 

SEQ ID NO: 15 

GP1 forward (containing Hindlll) EBOV (5'-CAA GCT TCA ATG GGC 
GTT ACA-3 ' ) , 

SEQ ID NO: 16 

MBGV (5'-AAG CTT AAC ATG AAG ACC ACA T-3 ' ) ; 
SEQ ID NO: 17 

GP2 forward (containing Pvul) EBOV (5'-GAC GAT CGG CAA TTG 
TCA ATG- 3 ' ) , 

SEQ ID NO: 18 

MBGV (5'-AGC GAT CGA TCC TCT GGA G -3'); 
SEQ ID NO: 19 

GP1 reverse (containing Pvul) EBOV (5'-GCC GAT CGT CGA GTT 
CTT CT-3 ' ) , 

SEQ ID NO: 20 

MBGV (5' -GAT CGA TCG CTT TCT TCT G-3 ' ) ; 
SEQ ID NO: 21 

GP2 reverse (containing EcoRI) EBOV (5'-TGA ATT CAA CTA AAA 
GAC AAA TTT G-3 ' ) , 

SEQ ID NO: 22 

MBGV (5' -CGA ATT CCG TTA TCC GAT ATA T-3') 
SEQ ID NO: 23 

GP1 forward (containing Hindlll) MBGV MUS (5' -AAG CTT AAC ATG 
AAG ACC ACA T-3 ' ) 

SEQ ID NO: 24 

MBGV RVN (5 -AAG CTT CGA CAT GAA GAC CAT AT- 3 ' ) ; 
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SEQ ID NO: 25 

GP2 forward (containing Pvul) MBGV MUS (5'-AGC GAT CGA TCC 
TCT GGA G -3 ' ) , 

SEQ ID NO: 26 

MBGV RVN ( 5 ' -AAC GAT CGA TTT TCT GGA A- 3 ' ) ; 
SEQ ID NO: 27 

GP1 reverse (containing Pvul) MBGV MUS (5' -GAT CGA TCG CTT 
TCT TCT G-3 ' ) , 

SEQ ID NO: 28 

MBGV RVN (5 '-AAA TCG ATC GTT TCT TTC TAA AG- 3 ' ) ; 
SEQ ID NO: 29 

GP2 reverse (containing EcoRI) MBGV MUS (5' -CGA ATT CCG TTA 
TCC GAT ATA T-3 ' ) , 

SEQ ID NO: 30 

MBGV RVN (5' -CGA ATT CTG TCA TCC AAT GTA T-3 ' ) . 



